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(54) NETWORK MONITORING SYSTEM 

(57)Abstract: 

PROBLEM TO BE SOLVED: To facilitate extension or change by 
enabling daisy chain connection to an interface part on the side of 
each alarming device, receiving alarm output data at one alarming 
device and successively transmitting these data to the other alarming 
devices. 

SOLUTION: When a monitoring device 20 receives a fault report from 
any device 10 to be monitored, concerning each fault, data showing 
the alarm level and required address of fault report are prepared and 
transmitted to an external alarming device 30 according to alarm 
output setting preset to an alarm output setting file. The alarming 
device 30 is equipped with a large network display panel 31 requiring 
the alarm level and the address, alarm lamp 32 and buzzer 33, which 
require only the alarm level, and issues an alarm while preparing data 
common for respective devices. Each IF part 40 has the same 

structure through the daisy chain connection and transmits the sent alarm level and address information as 
it is but at the lamp 32 or the buzzer 33, the information is ignored since no address is required. 
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DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to network monitoring system and the 
network monitoring system with which supervisory equipment outputs an alarm to two or 
more external alarms in response to the advice of a failure from supervisory equipment- 
ed, and tells a failure in more detail. 
[0002] 

[Description of the Prior Art] Network monitoring system receives a message from 
supervisory equipment-ed, such as the electronic automatic exchange which constitutes 
the network which is an object for a monitor, and outputs the information which 
expresses the specific condition of a network for the message. That is, it is constituted so 
that supervisory equipment may output an alarm to an external alarm in response to the 
advice of a failure from supervisory equipment-ed, and generally an alarm display is 
performed based on a message class. 

[0003] Drawing 7 (A) is drawing for explaining this conventional kind of network 
monitoring system, and, as for supervisory equipment-ed and 72, 71 is [ supervisory 
equipment and 73 ] external alarms. Moreover, drawing 7 (B) is the supervisory-control 
section prepared in conventional supervisory equipment 72. if the external alarm 73 
consists of various kinds of supervisory equipment other than a large-sized network 
display panel, such as an alarm lamp and a buzzer, and supervisory equipment 72 
receives advice of a failure from which supervisory equipment-ed ~ the alarm selection 
processing section 721 ~ operating — the content of the advice of a failure — responding - 
- a large-sized display panel, an alarm lamp, and a buzzer - an unit ~ or a multiple 
selection is made and an alarm is performed. Moreover, MJ (measure), MN (minor), and 
NR (Normal) which are a message class, i.e., significance, are determined. In addition, 
generating of a failure with serious MJ, generating of the failure that MN is small, and 
NR mean normal operating status. 

[0004] Since, as for the large-sized display panel of an external alarm, an alarm lamp, and 
a buzzer, interfaces differ, respectively, the alarm selection processing section 721 needs 
to operate each transmit data creation / communications control section 722-724, needs to 
create the transmit data of the individual protocol for every alarm, and needs to transmit 
to each alarm. 
[0005] 

[Problem(s) to be Solved by the Invention] Although the conventional network 
monitoring system is constituted as mentioned above and it operates, supervisory 
equipment will have many which used the workstation and the personal computer, and 
the number of external alarms connectable since the number of ports has a limit will be 
restricted. Therefore, installation etc. needs to become difficult at each ** and the 
operator needs to always be supervising the external alarm in front of an external alarm. ] Vw**' 
[0006] Moreover, since the connection interfaces of each alarm differ, respectively, it is 
necessary to prepare the protocol according to individual, and in duplication and 
modification of an alarm, it is necessary to change the program of supervisory equipment. 
[0007] Moreover, in the conventional network monitoring system, since the above- 



mentioned message classes MJ, MN, and NR are defined uniformly and customize by the 
operator cannot be performed, the period of the waiting under duplication work and for 
failure correction etc. needs to turn off the power source of an alarm for an alarm to be 
outputted and prevent this, also when the operator already consents, but if an alarm is 
turned off, the inconvenience of stopping being able to carry out the alarm also of the 
advice of a failure from other supervisory equipment-ed will arise. 
[0008] Furthermore, supervisory equipment and each alarm cannot be easily 
synchronized, when an inequality may arise in the alarm generation condition of 
supervisory equipment and each alarm and an inequality arises, since power-source 
ON/OFF is independently. That is, since the supervisory equipment of the conventional 
network monitoring system only sent out alarm-output data to each alarm once in 
response to the advice of a failure from supervisory equipment-ed, when which power 
source of an alarm turned off, it had troubles, like the alarm state of the alarm concerned 
becomes supervisory equipment and an inequality. 

[0009] It is made in order that this invention may solve this trouble, the external alarm of 
the required number can be installed, without receiving a limit in the output port of 
supervisory equipment, the duplication can also be performed easily, and customize by 
the operator, such as controlling the alarm from specific supervisory equipment-ed, is 
possible, and it is aimed at offering the network monitoring system which can 
synchronize easily the inequality of the alarm state of supervisory equipment and each 
alarm further. 
[0010] 

[Means for Solving the Problem] The network monitoring system of this invention 
receives the advice of a failure from the supervisory equipment-ed which constitutes a 
network (trap) with supervisory equipment. In the network monitoring system with which 
supervisory equipment transmits alarm-output data to the external alarm which consisted 
of two or more kinds of alarms, and each alarm emits an alarm based on this alarm-output 
data A means to communalize the alarm-output data to each alarm, and to use a common 
interface between supervisory equipment and each alarm, The interface section in which 
daisy chain connection is possible is used for the interface section by the side of each 
alarm. Said alarm-output data which said supervisory equipment transmits are received in 
the interface section of one alarm, and it is characterized by having the means which 
carries out sequential transmission of this at a daisy chain at the interface section of other 
alarms. It becomes unnecessary therefore, to be able to make now easily the duplication 
and modification of each alarm which are prepared in an external alarm, and to prepare 
the protocol according to individual. 

[001 1] Moreover, supervisory equipment receives the advice of a failure from the 
supervisory equipment-ed which constitutes a network (trap). In the network monitoring 
system with which supervisory equipment transmits alarm-output data to two or more 
external alarms which consisted of two or more kinds of alarms, and each alarm of each 
external alarm emits an alarm based on this alarm-output data A means to communalize 
the alarm-output data to each alarm, and to use a common interface between supervisory 
equipment and each external alarm, The interface section in which daisy chain 
connection is possible is used for the interface section by the side of each external alarm. 
Said alarm-output data which said supervisory equipment transmits are received in the 
interface section of one external alarm, and it is characterized by having the means which 



carries out sequential transmission of this at a daisy chain at the interface section of other 
external alarms. It becomes unnecessary therefore, to be able to extend an external alarm 
now easily and to prepare the protocol according to each alarm individual. 
[0012] Said supervisory equipment is equipped with a storage means. Moreover, for said 
storage means The failure generating status file which stores the advice of a failure from 
said supervisory equipment-ed, The alarm-output configuration file which sets up the 
alarm level and the address required for a specific alarm when performing [ whether an 
alarm output is performed to said advice of a failure, and ] an alarm output, It is 
characterized by preparing the alarm-output status file which memorizes serially said 
alarm-output data created from the content of said failure transaction file, and the content 
of said alarm-output configuration file. 

[0013] Moreover, it is characterized by having equipped said supervisory equipment with 
the interactive I/O device, and having a means to use said interactive I/O device, and to 
set up and change the content of said alarm-output configuration file. Therefore, the 
inconvenience by the operator ~ become customizable and an alarm continues being 
emitted at the period under duplication work is cancelable. 
[0014] Furthermore, it is characterized by having a means to make all the contents 
memorized to said alarm state file output to said supervisory equipment using said 
interactive I/O device, and to attain synchronization with said supervisory equipment and 
said each alarm. Therefore, the inequality produced when the power source of an alarm is 
turned off is easily cancelable. 
[0015] 

[Embodiment of the Invention] Hereafter, the operation gestalt of this invention is 
explained using a drawing. Drawing 1 is the block diagram showing 1 operation gestalt 
of the equipment configuration of the network monitoring system of this invention, and, 
as for supervisory equipment-ed and 20, 10 is [ supervisory equipment and 30 ] external 
alarms in drawing. As for the supervisory equipment 10-ed connected, two or more 
supervisory equipment-ed of 1 1-13 exists. Moreover, storage and an interactive I/O 
device are formed in supervisory equipment 20 (not shown [ both ]), and as shown in 
storage at drawing 2 , the failure generating status file 201, the alarm-output 
configuration file 202, and the alarm-output status file 203 are stored in it. Moreover, the 
external alarm 30 consists of each alarm of the large-sized network display panel 31, an 
alarm lamp 32, and buzzer 33 grade, and the interface (IF) section 40 is formed in each 
alarm at each. Drawing 3 is the block diagram showing the configuration of the IF section 
40, it is the gate (gate which consisted of software actually) which 401 carries out 
transmission and reception of the data of each IF section 40 in the relay circuit for a 
bypass, and carries out AND of 402 in the unit of transmission and reception, 
respectively, and it is constituted so that daisy chain connection of two or more IF 
sections 40 of a base may be made by this. 

[0016] If supervisory equipment 20 receives the advice of a failure (trap) from which 
supervisory equipment 10-ed, to each trap, according to alarm-output setting out 
beforehand set as the alarm-output configuration file (202 of drawing 2 ), the alarm levels 
MJ (measure), MN (minor), and NR (Normal) of advice of a failure and the data of the 
required address (for example, address of into which part of a panel to perform an alarm 
display) will be created, and it will transmit to the external alarm 30. Although there is 
each alarm of an alarm lamp 32 and a buzzer 33 which needs only an alarm level, and the 



large-sized network display panel 3 1 which needs the address and an alarm level among 
the external alarms 30, with this operation gestalt, data common to each alarm are created 
and an alarm is generated. Since each IF section 40 is the same configuration, the alarm 
level and address information which have been sent are transmitted as it is also to an 
alarm lamp 32 and a buzzer 33, but at an alarm lamp 32 or a buzzer 33, since the address 
is not needed, it ignores. 

[0017] Moreover, as shown in drawing 3 , supervisory equipment 20 and each IF section 
40 are connected using the RS-232C connector and the cable, and a relay switch (SW1, 
SW2) operates so that the relay circuit 401 for a bypass established in the IF section 40 
may constitute a bypass circuit, if the power source OFF of the IF section 40 is detected. 
Moreover, in the IF section 40, AND of transmission and the received data of each IF 
section 40 is carried out by the gate 402 in the unit of transmission and reception, and as 
shown in drawing 1 , daisy chain connection of two or more IF sections 40 is made. 
[0018] Next, the content of each file stored in the storage of the supervisory equipment 
20 of drawing 2 is explained. The failure generating status file 201 stores whether it is 
under [ failure generating ] ****** about all the failures notified by the trap. A failure 
generating condition memorizes the failure notified by the trap, setting out and the 
comment input of the address information at the time of an operator displaying the alarm- 
output configuration file 202 on setting out of ON/OFF (/an alarm output is carried out - 
don't carry out) of an alarm output performed by using an interactive I/O device, setting 
out of alarm level (significance) MJ/MN/NR, and a large-sized display panel according to 
advice of a failure are memorized. These setting out consists of these operation gestalten 
so that an operator may use the keyboard and mouse of an interactive I/O device and can 
set up and change as desired. The alarm-output status file 203 is a file which memorizes 
serially the alarm-output data created and outputted with supervisory equipment 20 from 
the content of the failure transaction file 201, and the content of the alarm-output 
configuration file 202, and the alarm-output condition of the alarm level under all address 
part and current generation is memorized. 

[0019] Drawing 4 (A) is a flow chart which shows actuation when supervisory equipment 
20 receives the advice of a failure (trap). Supervisory equipment 20 will update the 
content of the failure generating status file 201 based on the failure notified by the trap, if 
a trap is received. Next, with reference to the alarm-output configuration file 202, if 
alarm-output setting out is ON, supervisory equipment 20 will carry out renewal of 
record of MJ/MN/NR at the alarm-output status file 203 according to an alarm level, and 
will output an alarm level (MJ/MN/NR) and address information to the external alarm 30. 
Moreover, when alarm-output setting out of an alarm-output configuration file is OFF, 
renewal of the alarm-output status file 203 is not performed, and the output to the 
external alarm 30 is not performed, either. 

[0020] Drawing 4 (B) is a flow chart which shows alarm-output setting-out modification 
processing and alarm synchronization processing. In alarm-output setting-out 
modification processing, using the keyboard and mouse of an interactive I/O device, the 
content of the alarm-output configuration file 202 is changed, and an operator updates the 
content of the alarm-output status file 203 according to alarm-output setting out which 
read the alarm-output status file 203 to the interactive I/O device, and changed it 
continuously, and transmits all the information from this file 203 to the external alarm 30. 
[0021] Next, alarm synchronization processing is explained. Although it is possible to 



perform alarm synchronization processing from both by the side of supervisory 
equipment 20 and the external alarm 30, in carrying out from a supervisory equipment 20 
side, it outputs an alarm synchronization processing demand command to supervisory 
equipment 20 using the keyboard and mouse of an interactive I/O device. Supervisory 
equipment 20 transmits all the information that receives this command and is in the 
alarm-output status file 203 to the external alarm 30, and is made in agreement with the 
current condition that supervisory equipment 20 has the condition of all the display 
panels of each alarm, a lamp, and a buzzer. That is, the current condition that supervisory 
equipment 20 has the alarm which the power source is turned off and became an 
inequality is synchronized. Moreover, in outputting an alarm synchronization processing 
demand command from the external alarm 30 side to supervisory equipment 20, an alarm 
side transmits in the procedure in consideration of the collision of synchronous request of 
a slave side in the slave side of the multidrop of RS232C (using polling, a selecting 
method, etc.). 

[0022] Next, the example of this invention is explained using drawing 1 and drawing 2 . 
For example, when "xxx basic trunk failure" occurs with supervisory equipment 10-ed, 
"xxx basic trunk failure generating" is notified to supervisory equipment 20 by the advice 
of a failure (trap). Supervisory equipment 20 stores "xxx basic trunk failure generating" 
in the failure generating status file 201, if this trap is received. And the record which 
corresponds to xxx basic trunk failure with reference to the alarm-output configuration 
file 201 is searched. 

[0023] Supervisory equipment 20 records the print-out outputted to the external alarm 30 
on the alarm state output file 203 according to setting out of the output of this record, an 
alarm level, and the address, when the record applicable to xxx basic trunk failure is 
found. In the alarm-output configuration file 202, ON and an alarm level create the data 
(MJ generating / MN-less/NR nothing and xxx) as information which outputs them to the 
external alarm 30 since MJ (major) and the address are xxx(es), and an output transmits. 
Each external alarm 30 receives this data (MJ generating, xxx), with each alarm, is 
displayed and carries out singing of the alarm. 

[0024] Next, actuation of each alarm is explained. With the large-sized network display 
panel 31, data (MJ generating, xxx) are received and the circuit which the address of xxx 
shows is displayed in MJ color (for example, red). Moreover, in an alarm lamp 32, data 
(MJ generating, xxx) are received and the lamp of MJ color (for example, red) is turned 
on (since it is unnecessary, the address is disregarded). Furthermore, at a buzzer 33, data 
(MJ generating, xxx) are received and singing of the buzzer is carried out by MJ (it is 
continuation singing until it restores for example, with the maximum sound volume) 
(since it is unnecessary, the address is disregarded). Moreover, in another room (another 
floor, separate building), since the same external alarm 30 by which daisy chain 
connection was made between the IF sections 40 is installed and each alarm can receive 
data (MJ generating, xxx), alarm actuation is performed similarly. 
[0025] Next, for the reason under work etc., the actuation which stops / resumes xxx 
basic trunk failure alarm under present output is explained using drawing 2 and drawing 4 
. An operator changes the output of xxx basic trunk failure of the alarm-output 
configuration file 202 of drawing 2 from ON at OFF using the keyboard and mouse of an 
interactive I/O device. Moreover, the comment of a purport under construction now is 
added if needed. According to setting-out modification of this output, from the failure 



generating status file 201 and the alarm-output configuration file 202, supervisory 
equipment 20 updates the alarm state of the address of the alarm-output status file 203 
concerned, and transmits it to the external alarm 30. 

[0026] Since it not only restores the alarm state of the address of the alarm-output status 
file 203 concerned, but it may be carrying out the alarm display including two or more 
fault conditions in the one address like for example, bbb equipment failure (address bbb) 
and ccc card failure (address bbb) when restoring alarm cutoff, it is necessary to judge 
again the fault condition generated by then in the corresponding address, and to update 
the alarm-output status file 203. By the same approach, only not only in setting-out 
modification of the output of an alarm, and the alarm level of the alarm-output 
configuration file 202 and setting-out modification of the address can be made (in the 
flow of alarm-output modification, it is supposed that simple is carried out and all alarms 
are outputted again). 

[0027] Finally synchronization processing is explained. In this case, all the contents of 
the alarm-output status file 203 are outputted, and it is made in agreement with the 
current condition that supervisory equipment has the condition of all the display panels of 
each alarm, a lamp, and a buzzer. Although it is not easy to become supervisory 
equipment 20 and an inequality and to synchronize this when the power source is turned 
off for any of an alarm they are at the time of the data transmission since it does not have 
a means to memorize the transmitted alarm-output data in the conventional network 
monitoring system , with this operation gestalt , it can synchronize easily by make all the 
contents of the alarm-output status file 203 output . 

[0028] Drawing 5 is the block diagram showing other operation gestalten of this 
invention. Although considered as the configuration which forms the IF section 40 in the 
external alarm 30 with the operation gestalt shown in drawing 1 , it is good also as a 
configuration which makes the IF section 40 become independent of the external alarm 
30 as shown in drawing 5 . 

[0029] Moreover, drawing 6 is the block diagram showing the operation gestalt of further 
others of this invention. Although considered as the configuration which forms the IF 
section 40 in each alarm, respectively with the operation gestalt shown in drawing 1 , it is 
good for the external storage 30 installed one set at a time in each ** (a floor, ridge) by 
making the number of connectors of the IF section 40 into plurality also as a 
configuration which forms one IF section 40, respectively. 
[0030] 

[Effect of the Invention] As explained above, the network monitoring system of this 
invention communalizes the alarm-output data to each alarm, and since [ the interface 
section ] daisy chain connection is made, a system, such as duplication and modification 
of an alarm, and forming an external alarm in two or more rooms further, can be extended 
easily. It becomes unnecessary moreover, for supervisory equipment to create alarm- 
output data for every alarm. Moreover, an operator can control an alarm by changing the 
content of the alarm-output configuration file according to an individual, and 
inconvenience, such as being continued by emitting this an alarm etc. during a 
duplication period, can be canceled. Since it has a means to make all the contents 
furthermore memorized to the alarm state file output, and to attain synchronization with 
supervisory equipment and each alarm, there is effectiveness that the inequality of the 
alarm state of an alarm is easily cancelable etc. 



CLAIMS 



[Claim(s)] 

[Claim 1] Supervisory equipment receives the advice of a failure from the supervisory 
equipment-ed which constitutes a network (trap). In the network monitoring system with 
which supervisory equipment transmits alarm-output data to the external alarm which 
consisted of two or more kinds of alarms, and each alarm emits an alarm based on this 
alarm-output data A means to communalize the alarm-output data to each alarm, and to 
use a common interface between supervisory equipment and each alarm, The interface 
section in which daisy chain connection is possible is used for the interface section by the 
side of each alarm. Network monitoring system characterized by having received said 
alarm-output data which said supervisory equipment transmits in the interface section of 
one alarm, and having the means which carries out sequential transmission of this at a 
daisy chain at the interface section of other alarms. 

[Claim 2] Supervisory equipment receives the advice of a failure from the supervisory 
equipment-ed which constitutes a network (trap). In the network monitoring system with 
which supervisory equipment transmits alarm-output data to two or more external alarms 
which consisted of two or more kinds of alarms, and each alarm of each external alarm 
emits an alarm based on this alarm-output data A means to communalize the alarm- 
output data to each alarm, and to use a common interface between supervisory equipment 
and each external alarm, The interface section in which daisy chain connection is possible 
is used for the interface section by the side of each external alarm. Network monitoring 
system characterized by having received said alarm-output data which said supervisory 
equipment transmits in the interface section of one external alarm, and having the means 
which carries out sequential transmission of this at a daisy chain at the interface section 
of other external alarms. 

[Claim 3] Network monitoring system according to claim 1 or 2 characterized by 
providing the following The failure generating status file which equips said supervisory 
equipment with a storage means, and stores the advice of a failure from said supervisory 
equipment-ed in said storage means The alarm-output configuration file which sets up the 
alarm level and the address required for a specific alarm when performing [ whether an 
alarm output is performed to said advice of a failure, and ] an alarm output The alarm- 
output status file which memorizes serially said alarm-output data created from the 
content of said failure transaction file, and the content of said alarm-output configuration 
file 

[Claim 4] Network monitoring system according to claim 3 characterized by having 
equipped said supervisory equipment with the interactive I/O device, and having a means 
to use said interactive I/O device, and to set up and change the content of said alarm- 
output configuration file. 

[Claim 5] Network monitoring system according to claim 3 characterized by having a 
means to make all the contents memorized to said alarm state file output to said 
supervisory equipment using said interactive I/O device, and to attain synchronization 



with said supervisory equipment and said each alarm. 



[Translation done.] 



METHOD AND SYSTEM FOR TESTING THE VALIDITY OF SHARED DATA IN A 

MULTIPROCESSING SYSTEM 

BACKGROUND OF THE INVENTION 

[0001] The present disclosure relates generally to a method and apparatus for testing 

the validity of shared data in a multiprocessing system, and particularly to a method and 
apparatus for random testing data block concurrency, execution sequence and serialization in 
multiprocessing systems in response to pseudo-random instruction streams. 

[0002] In a typical multiprocessing environment, a number of central processing units 

(CPUs) share data residing in a particular location of the main/cache memory. These CPUs 
may be required to obey certain rules such as block concurrence, execution order and 
serialization. The block concurrence rule states that while a given CPU is fetching from or 
storing into a block of shared memory, the operation must be done in indivisible units. For 
example, if a fetch unit is four bytes, then the CPU is not allowed to fetch the data in two or one 
byte increments. The execution order rule requires that all stores and fetches are executed in 
order. In addition, the serialization rule refers to the ability of the CPU to use pre-fetched data 
until the CPU serializes, at which point it discards all pre-fetched data and completes all 
pending stores before it executes new fetches. 

[0003] An access by an instruction to a memory location may be a fetch, a store or an 

update. Tn the case of the fetch access, the instruction does not alter the contents of the 
memory location. In contrast, the store access type instruction modifies the contents of the 
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memory location. An instruction that fetches from and then stores into the same memory 
location performs an update access type. An update access operation can be viewed as a fetch 
operation followed by a store operation. 

[0004] Several random test methodologies regarding cache coherence of the weakly- 

ordered (e.g., Reduced Instruction Set Computer (RISC)) architecture, such as PowerPC, 
have been reported in the literature. In the data-coloring technique, each store operation writes 
unique data into the common memory location. In this method if instructions storing into the 
shared locations are allowed to access a storage location consisting of a combination of shared 
and non-shared locations, one has to account for the worst case and thus limit store operations 
affecting these shared locations to 256 (maximum number of distinct data colors that can be 
represented with one byte). Therefore, the number of instructions of a given CPU that could 
write into the shared location becomes severely limited as the number of CPUs increases. The 
use of monotonic input data for multiprocessors and analyzing the terminal results is a special 
case of the data coloring technique. The false sharing method partitions the shared memory 
location such that different CPUs store into and fetch from non-overlapping locations. Other 
approaches force synchronization instructions before each store operation that stores into the 
shared location. However, these techniques concentrate on weakly-ordered architectures such 
as PowerPC and do not consider when arithmetic and logic operations engage in register-to- 
memory, memory-to-register, register-to-memory or memory-to-memory data transfer 
operations. Furthermore, the fact that an instruction stream and translation tables can be 
subject to modifications due to the actions of some CPUs are not taken into account. 

[0005] In the case of a firmly-ordered architecture (e.g., Complex Instruction Set 

Computer (CISC)), such as IBM's System 390 architecture, a store operation executed by an 
instruction can take place (as seen by other CPUs) only after all the preceding store operations 
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are completed. Furthermore, both logical and arithmetic operations of the IBM System 390 
architecture can involve register-to-memory, memory-to-register and memory-to-memory 
operations. However, unless another CPU observes, one cannot conclude that any of the store 
operations of a given CPU are completed. Hence, determining the validity of the fetched data 
from a memory location is complicated when more than one CPU may have stored data into 
that location. Data fetched by a CPU,- from a memory location M which has been subject to a 
number of store operations by different CPUs may not adhere to certain rules such as the block 
concurrence, execution order and serialization rules. 

SUMMARY OF THE INVENTION 

[0006] In one embodiment, a method for testing the validity of shared data in a 

multiprocessing system is disclosed. The method comprises receiving at a first central 
processing unit a list of fetch and store instructions associated with blocks in a shared memory 
location. The list includes a data value, a central processing unit identifier and a relative order 
associated with the instructions. In addition, one of the data values associated with one of the 
instructions was stored by a memory-to-memory, memory-to-register or register-to-memory 
operation. Further, one of the central processing unit identifiers associated with one of the 
instructions is an identifier corresponding to one of a plurality of central processing units that 
have access to the shared memory location including the first central processing unit. A fetch 
operation is performed at a block in the shared memory location from the first central 
processing unit. Fetched data is received at the first central processing unit in response to the 
performing, where the fetched data was stored by one of the plurality of central processing 
units. The method verifies that the fetched data conforms to a block concurrency rule in 
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response to the list and that the fetched data conforms to a serialization rule in response to the 
list. 

[0007] In another embodiment, a computer program product for testing the validity of 

shared data in a multiprocessing system is disclosed. The computer program product comprises 
a storage medium readable by a processing circuit and storing instructions for execution by the 
processing circuit for performing a method. The method comprises receiving at a first central 
processing unit a list of fetch and store instructions associated with blocks in a shared memory 
location. The list includes a data value, a central processing unit identifier and a relative order 
associated with the instructions. In addition, one of the' data values associated with one of the 
instructions was stored by a memory-to-memory, memory-to-register or register-to-memory 
operation. Further, one of the central processing unit identifiers associated with one of the 
instructions is an identifier corresponding to one of a plurality of central processing units that 
have access to the shared memory location including the first central processing unit. A fetch 
operation is performed at a block in the shared memory location from the first central processing 
unit. Fetched data is received at the first central processing unit in response to the performing, 
where the fetched data was stored by one of the plurality of central processing units. The 
method verifies that the fetched data conforms to a block concurrency rule in response to the list 
and that the fetched data conforms to a serialization rule in response to the list. 

[0008] In a further embodiment, a system for testing the validity of shared data in a 

multiprocessing system is disclosed. The system comprises a plurality of central processing units 
including a first central processing unit and a shared memory location accessed by the plurality of 
central processing units. The system further comprises application software located on the first 
central processing unit for implementing a method comprising receiving at the first central 
processing unit a list of fetch and store instructions associated with blocks in the shared memory 
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location. The list includes a data value, a central processing unit identifier and a relative order 
associated with the instructions. In addition, one of the data values associated with one of the 
instructions was stored by a memory-to-memory, memory-to-register or register-to-memory 
operation. Further, one of the central processing unit identifiers associated with one of the 
instructions is an identifier corresponding to one of the plurality of central processing units that 
have access to the shared memory location including the first central processing unit. A fetch 
operation is performed at a block in the shared memory location from the first central processing 
unit. Fetched data is received at the first central processing unit in response to the performing, 
where the fetched data was stored by one of the plurality of central processing units. The 
method verifies that the fetched data conforms to a block concurrency rule in response to the list 
and that the fetched data conforms to a serialization rule in response to the list. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10009] Referring to the exemplary drawings wherein like elements are numbered alike in 

the accompanying Figures: 

[0010] FIG. 1 is a block diagram of a high level process flow for an exemplary 

embodiment of the present invention; 

[0011] FIG. 2 is an exemplary embodiment of an algorithm that may be utilized to test 

for block concurrence; 

[0012] FIG. 3 is an exemplary embodiment of an algorithm for testing machine 

compliance to serialization rules; and 
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[0013] FIG. 4 is a block diagram of an exemplary system for testing the validity of 

shared data in a multiprocessing system. 

DETAILED DESCRIPTION OF THE INVENTION 

[0014] An exemplary embodiment of the present invention provides a method of testing 

the validity of data fetched by a given CPU from storage locations that are subject to store 
and/or update operations by multiple CPUs. This includes both load and store operations as 
well as arithmetic and logical operations. The conformance of a machine to the block 
concurrence, order of execution and serialization rules is verified. In an exemplary embodiment 
of the present invention, it is assumed that prior to the analysis, the order in which the CPUs 
access the shared location is not known (i.e., time relationship among the CPUs is not available). 
FIG. 1 is a block diagram of a high level process flow for an exemplary embodiment of the 
present invention. The high level process flow depicts an exemplary method for testing of the 
validity of shared data in a multiprocessing system. At step 102, the shared memory locations 
are identified. At step 104, a pseudo-random test instruction stream is built for each CPU. 
These pseudo-random streams include instructions that fetch and store into the shared memory 
locations. A table of shared information (TOSHI) is built at step 1 04. When an instruction is 
selected to access a shared location, a TOSHI entry is created for that instruction. A TOSHI 
entry includes information such as the instruction address, the target address of the data, the 
source address of the data, any registers involved in the shared location and any associated flags 
(e.g., an execution flag). It is assumed that instructions are executed by their respective CPUs 
and not in a simulation environment. If simulation is available the TOSHI will contain less 
information. 
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[0015] In an embodiment of the present invention, a single operation accessing a shared 

location may simultaneously write to or read from a combination of shared and non-shared 
locations. For example, if the instruction stores four bytes into the memory, the following 
combinations are possible: all four bytes are part of a shared location; three bytes are part of a 
shared location and the other byte is part of a non-shared location; two bytes are part of a 
shared location and the other two bytes are part of a non-shared location; one byte is part of a 
shared location and the other three bytes are part of a non-shared location; or all four bytes arc 
from a non-shared location. In other words, an embodiment of the present invention allows 
processors to share up to the bit level. In addition, in an embodiment of the present invention, 
when an instruction fetches data from a shared location, the data is protected (i.e., is not 
overwritten) until it is made observable by another store operation. For example, if instruction 
*T fetches from a shared location "B" and stores the data into "A", where "A" is a register, the 
register is marked (via TOSHI flags) that it contains data that needs to be observed. Therefore, 
until another store-type instruction uses the register and stores into a non-shared location, no 
instruction can overwrite the data contained by this register. To maintain the randomness of the 
instruction stream, a number of randomly selected instructions is inserted into the stream 
between the point where an instruction fetches from a shared location and when the fetched data 
is made observable. 

[0016] At step 106, the stream of instructions are passed to their respective CPUs. 

The streams are then executed by their respective CPUs. New pseudo random streams of 
instructions are built and passed to the CPUs if no error is detected in the previous streams. At 
step 108, relevant data information is gathered from the execution as directed by the entries in 
the TOSHI. In an exemplary embodiment of the present invention, the relevant data structure 
extracted from TOSHI includes lists containing the data stored by each CPU to each shared 
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memory location and the addresses of instructions that are storing into or fetching from the 
shared location, the addresses of the source or observable (i.e., known or resultant) data, and 
those tnstructions'that cause serialization. At step 110, conformance to block concurrency rules 
is verified. The block concurrency rule checks that a CPU stores and fetches in indivisible units 
(e.g., half-word, double-word). In addition, at step 1 10, conformance to the execution order 
rules is verified. Execution rules refer to the requirement that all stores and fetches to the shared 
memory location should be performed in order. Conformance to serialization rules is verified at 
step 112. Serialization occurs when a CPU is required to discard all pre-fetched data and 
completes pending data stores. An embodiment of the present invention verifies that a CPU has 
performed the serialization and has discarded all pre-fetched data. At step 1 14, the process, 
starting at step 102, is repeated if there was no error in steps 1 10 and 112. If there was an 
error in step 1 10 or step 1 1 2, the process depicted in FIG. 1 is exited with an error message. 

Fetch/Store Operations and Block Concurrency Verification 

[0017] In this section, an embodiment of the present invention that includes the ability to 

verify the block concurrence of memory locations, altered by register-to-memory and memory- 
to-memory store operations (e.g., IBM S/390 rx and ss-format y respectively) of different CPUs 
is described. Let M be a memory location which may be altered by a number of CPUs. The 
goal is to test if data fetched by CPU, from M is valid. It is assumed that immediately after a 
CPU stores into M all other CPUs can reference M and use the new value. A limitation on the 
observability of M is due to the fact that the contents of M can be observed only when a given 
CPU accesses M by executing a fetch-type instruction. For each CPU f that stores into M, a set 
or list L CPUi (M) a containing the stored data by the CPU is created. Once a particular CPU, 
fetches data D from M f an effort is made to determine: 
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1 . which CPU has stored the fetched data; and 

2. if the search in step 1 above is successful, then the uniqueness of the data (i.e., 

whether it was stored by a single CPU*) is checked. If the data is found to be 
unique, an attempt is made to reduce the set L C puk- 

In other words, the existence of D in all L CPU (M)s is investigated. An invalid data D has 
been fetched if D is not found in these sets. On the other hand, if D is a member of one or more 
of the L CPU (M)s, determining the CPU that stored D is performed next. In an exemplary 
embodiment of the present invention, the way that such a CPU can be identified is if D exists 
only in a single L CPl /M). If it is determined that CPU, stored D, then L CPUi is reduced, if 
possible, by removing all entries that have been stored by CPU/ prior to D from the set. Such 
reduction can be performed since all preceding store operations of CPU, must have been 
performed prior to storing D (conceptual sequence). The resulting list, L CPUif is then accessed to 
evaluate the data from the next fetch instruction that includes data stored by CPU,-. 

[0018] An exemplary embodiment of an algorithm that may be utilized to test block 

concurrence is shown in FIG. 2. Note that the size of M depends upon the instruction being 
executed and is an integral of the specified block concurrence. At step 202 v in the algorithm, a 
loop is initiated to retrieve instructions, denoted as "I" in FIG. 2, from the stream, denoted as 
"S" in FIG. 2. The stream includes a sequence of any number of instructions and may include 
instructions that access the shared locations. During stream execution, the following information 
is collected: instructions fetching from or storing into shared locations, the data value being 
stored, the data value being stored/fetched (if known), a CPU identifier corresponding to the 
CPU performing the instruction and the address of the instruction relative to other instructions in 
the stream. For each fetch instruction located, as tested at step 204, the "FOUND" flag is set to 
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false, the "ValidData" flag is set to false and the 'TJniqueData" flag is set to true. Each fetch 
instruction also includes a data value, "Z>", that was fetched. A loop, starting at step 206, is 
performed for each CPU that utilizes the shared data location, starting with the first CPU, while 
the "UniqueData" flag is still true. If the "UniqueData" flag is set to false (i.e., the data value U Z>' 
has been stored by more than one CPU), then the algorithm depicted in FIG. 2 continues to 
analyze the next instruction. If the data, "Z>", is located in the list of stores associated with the 
current CPU being searched, L CFlJh then at step 208, the "ValidData" flag is set to true. This 
means that the data being verified, "D" has been located and identified as being stored on one of 
the CPUs, and that "D" is valid data. Next, another loop is performed at step 210 to determine 
if the data, "D", is also located in a list associated with another CPU. If this is the case, then the 
"UniqueData" flag is set to false. Otherwise, at step 212, the list associated with the CPU where 
the data, "D" was located is reduced based on the position of "Z>" in the list. All data store 
entries prior to "D" in the list i CPW are removed. If the data is not located in one of the CPU 
lists, then the "ERROR" flag is set to true, indicating that the data is invalid and that the block 
concurrence rules were violated. 

[0019] For example, assume that CPU/ and CPU 2 store into shared data storage, M f 

the following data values, or "D s". 



Lcpvi 



a 



a 



b 



a 



c 



a 



c 



b 



d 



g 



e 
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f 

For purposes of this example, also assume that C?U 3 repeatedly executes fetch operations and 
retrieves "a", "c", "b" "g" and "f ' from the shared location. When CPU, fetches "a", the data 
fetched may have came from either CPU, or CPU,. Therefore, although L CPUJ and L CPU2 
cannot he reduced, the data is determined to be valid. When CPUj fetches "c", the fact that 
CPU 7 is the only one that stored "c" into M implies that "c" is valid data. In addition, the data 
values "a" and "b", stored by CPU,, are overwritten and are no longer valid data values. 
Therefore, the lists are reduced as follows. 



Lcpui 


Lcpui 


c 


a 


c 


a 


d 


a 


e 


b 


f 


g 



[0020] Similarly, when CPUj reads "b", the sequence of "a's" stored by CPU 2 is 

overwritten and cannot be read. The resulting lists of data are as follows. 

Lcpui Lcpui 

c b v 

c g 
d 



POU920020I25USI 



11 



[0021] When CPU, retrieves "g", the following lists of data result after 'V, which has 

been overwritten by CPU, is deleted from the list. 

Lcpui Lcpui 

c g 

c 

d 

e 

f 

[0022) When CPU, reads "f \ then V, "d" and "e" are no longer valid data values to 

be fetched because they have been overwritten by CPU,, and the lists of expected data are 
reduced to the following. 

Lcpui Lcpw 
f g 
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[0023] In another example, utilizing an exemplary embodiment of the present invention, 

it is assumed that the store operations of CPU; and CPU? happen in a particular order . In this 
example, CPU/ and CPU, execute the following store operations. 

CPU , CPU, 
M-d 

M-c. 

M-d 

[0024] Further, in this example, CPUj accesses M by executing a sequence of fetch 

operations. The lists of data observable by for CPUj are as follows. 

Lcpui Lcpm 
b d 
a c 
d 

[0025] First, suppose that CPUj fetches "d" Since "d" is stored by both CPU/ and 

CPU 2 (i.e., it is not unique data), the list of observable data does not get reduced. Next, if 
CPUj reads M c" it can be concluded that the fetched data is both valid and unique because "c" 
appears in only one list. The "d" stored by CPU? will never be fetched again by CPUj as stated 
by the conceptual sequence rules. Therefore, the lists of data that may be observed by CPUj are 
reduced as depicted below. 

• Lcpui Lcpuj 
POU920020125US* 
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be 

a 

d 

[0026] Next, in this example, CPUj retrieves "d". At this time "d" is both valid and 

unique data. Fetching "d M implies that CPU, will not be able to observe "c" since all fetches 
must in order. Also, CPU, will not be able to fetch the "b" and "a" stored by CPU, again and 
the lists of observable data for CPU 3 are reduced to the following. 

LcVUt LCPV2 

• d 



[0027] Next, is an example where the block concurrency is violated. Suppose the list 

of CPUj andCPU 2 stored data is the following. 

( 

Lcpui Lcpui 
abed ijkl 
efgh 

A block concurrency/execution order violation would occur, for example, if CPU 3 fetches "ijgh". 
Arilhmetic/Loeical Operations and Block Concurrence 

[0028] An exemplary embodiment of the present invention checks the conformance of logical 
and arithmetic operations, that fetch data from shared memory locations, to the block 
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concurrence rules. Such operations are in the form of "A - A <op> B" where "<op>" is a 
logical or arithmetic operator, "A" is retrieved from and then stored into a register, and "B" is 
fetched from a shared memory location considered. When B is from a shared memory location, 
the method creates observable points such that the value loaded into A is available for future 
analysis. Since both the old and final values of A are known, an exemplary embodiment of the 
present invention attempts to determine some or all of the bits of B. The method uses reverse 
operations to find some or all of the bits fetched from the shared locations. Although a strong 
conclusion cannot be made just by knowing only a portion of B's bits, such portion of the data 
may be utilized to deduce whether a given CPU "possibly" stored B. The search is performed 
as described in the previous section except that certain bits of B may not be known. As a result, 
decisions are made based on possibilities. 

[0029] In the following example for determining all or a portion of the bits of B, B is 

fetched from a shared memory location and A is retrieved from and then stored into a register. 
The bits of A are referred to as "a" and the bits of B are referred to as"f\". Also, in the 
example, "az o/rf " is the value of a, before the store operation is executed and "ai nw T is ^ valuc 
of a, after the store is completed. Exemplary calculations for determining all or a portion of the 
bits in B are described below for logical operations and arithmetic operations. 

To xical Operations 

XQR 

bi = ai new XORai old 

AND 
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case 1: 

(ai new = 1) AND(a/ oM =l)=>6 ( = l 

case 2: 

(ai new - 0) AND (ai M = 1) => b, = 0 

case 3: 

(<IW= 0) AND = 0) => 6, = u 

case 4: 

(a/^ = 1) AND {ai oli = 0) => b-, = u; 
where "u" denotes unknown. 

OR 

<"«w *~ ai oid OR 
case 1: 

{ai^ = 0) AND K w = 0) => 6, = 0 

case 2: 

K w = I) AND (ai M = 0) => 6, = 1 

case 3: 

(ai new = 1) AND = 1) => 6, = u 

case 4: 

(a/ new = 0) AND K„ - 1) => b, = u 
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Arithmetic Operations 

Logical Addition 

B — ' ^oWj 

where X )ew is the value of A after the store is completed and A aId is the 
value of A before the store is executed 

Logical Subtraction: 

Anew ~ ^old ' B 

B^A old + A new 
Multiplication: 

The product of multiplying A and B, where both are n bit operands, is 
placed in a field, "P" of 2 *n bits. The most significant bit of the product 
represents the sign. P may be made up of an even /odd pair of registers (e.g., in 
the IBM S/390 architecture). 

P\ A* and B' represent the magnitude of P, A, and B, respectively. 
Also, s{X) is the sign of X (note that an attempt to find B should not be made if 
A is equal to zero) and V stands for unknown. 

P- A*B 
B' = P'/A! 

case 1: (s(?) = 1) AND (s(A = 1) =» 5(B) - I 
case 2: (s(?) = 0) AND (s(A = 1) => s(B) = 0 
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case 3: (s(P) - 0) AND (s(A = 0) s(B) = u 
case 4: (s(P) - 1) AND (s(A = 0) - s(B) = u 

Division: 

Q' *- Ao^/B, where 0' is the quotient 
B' = A'/Q' 

case 1: (s(Q) = 1) AND (j(A = 1) => s(B) = 1 
case 2: (s(Q) = 0) AND (s(A = 1) - 5(B) = 0 
case 3 : (s(Q) = 0) AND (s(A = 0) => *(B) = u 
case 4: (5(Q) = 1) AND (j(A = 0) - ^(B) = u 

[0030] Additionally, to facilitate the observability of data in a shared memory location, 

M, after an update operation, A <- A <op> B, stores into M, B may be chosen such that 
portions of the data retrieved by subsequent fetch instructions are "predictable". Logical 
operations may be utilized to store predictable data into M as shown below. 

A «- A AND B :(£>, = <))=> (a, = 0) 
A <= A OR B : (6 f = I) =* (a, = 1) 

[0031) In an exemplary embodiment of the present invention, the above list of logical 

and arithmetic operations may be expanded to include other operations. For example, an 
embodiment may be extended to nonlogical arithmetic and take into account underflow and 
overflow by masking the status information bits. 

[0032] When a shared location is both a source and a target, the analysis becomes 

more difficult. However, the stream may be biased so that meaningful information maybe 
determined from subsequent fetches. For example, when A (both a target and a source) is from 
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a shared iocation and B (a source) is from a non-shared location, the logical operations of AND 
and OR and use B as a mask. If the data stored by this update instruction is observed by any 
CPU, the mask is used to assure if certain bits are on (for the OR operation) or off (for the 
AND operation). 

[0033] An embodiment of the present invention also determines if instruction fetching, 

from shared locations, obeys specific block concurrence rules as described by the architecture. 
For example, in the IBM/390 architecture, instruction fetching must be half word block 
concurrent- If portions of the instruction stream belong to the shared memory, it is possible that 
some of the CPUs will alter the instruction stream by executing store or update type instructions. 
Furthermore, the fact that one cannot observe the fetched instructions adds to the complexity of 
analyzing fetched instructions. In some system architectures (e.&, the IBM S/390), the length of 
an instruction can be two, four, or six bytes. Once a CPU stores into the stream, one or more 
instructions may get modified. As an example, storing a six-byte data into the stream can 
replace three two-byte, one two-byte and one four-byte, or one six-byte instruction(s). As a 
result of storing into the stream area, some of the executed instructions may not appear in the 
original stream of instructions since certain CPUs may have altered the stream by storing data 
into the shared locations. The combination of possible instructions and the operands used by 
these instructions may become extremely large. However, such combinations can be 
dramatically reduced by associating certain instructions with particular operands. Without losing 
generality, further reduction can also be realized by selecting, randomly, the unknown 
instructions in a given stream to a specific group of instructions. As an example, arithmetic 
instructions may be selected in one stream and logical instructions in another. To analyze the 
result (if any) of the fetched instruction, an embodiment of the present invention attempts to 
determine, or guess, the instruction that would produce the result and then, if such instruction 
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does not belong to the original instruction stream, whether the instruction was inserted into the 
stream by one of the CPUs. The problem then becomes similar to those described previously 
except that at this time the fetched data, which is an instruction, was guessed rather than 
observed. 

[0034] An example of how this is may be accomplished follows. Let S cpui and A (S cp J 

be the stream of instructions and their addresses, respectively, that CPU/ is executing as 
described in the table below. 

0000 R 2 <=R 2 + R 3 . 0010 0002 *= li^l 

0002 R 2 -R 2 + Rj 

0004 R 2 <=R 2 + R 2 

For simplicity, all instructions affecting R 2 are restricted to performing only arithmetic operations. 
Let the initial values of R h R 2 , and R 3 be 1, 2, and 3, respectively. In addition, it is assumed that 
R 2 was observed to have the values of 5, 4, and 8. Along with the original stream generated for 
CPU/, the list of data stored into the stream by CPU 2 is available. In this example, such a list 
includes IF21 which is equivalent to R 2 <=R 2 -Rj. Based on the instruction stream generated 
for CPU /S the instruction inserted into that stream by CPU 2 , and the different values contained 
by R 2t the algorithm checks to see if the executed instructions were fetched properly. The initial 
value contained by R 2 was 2. The second value, 5, can be guessed to be obtained as a result of 
the add instruction at 0002. It can be assumed that R 2 <=*R 2 + Rj was not executed since such 
an operation would produce a result of 6. Therefore, guessing that a subtract operation may 
have been performed by CPU/, the algorithm checks the list of data inserted into the stream by 
CPU 2 . Such data confirms the guessing that a subtract operation should be performed if the 
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inserted instruction is executed. A comparison of the result that could be produced by the 
inserted instruction and the value contained in R 2 assure that such an instruction was fetched and 
executed properly The above analysis maybe extended to the four and six-byte long 
instructions. It is important to emphasize, that instructions whose lengths are more than two- 
bytes can result due to different store/update operations of different CPUs. Therefore, the 
analysis should include investigating if such instructions were due to the actions of a single or 
different CPUs. 

[0035] Additionally, locating operands and/or instructions, addressed by the instructions 

of a stream, may require the use of a translation process before storing into or retching from 
implicitly specified locations; The translation process of the IBM S/390 system architecture, for 
example, involves using tables such as the segment-table and page-table. Similar to the 
instruction stream case, a translation table designated for a given CPU,- may belong to the shared 
location. Hence, under certain conditions, one or more CPUs may manage to modify entries of 
CPU/s translation table. As a result, some data will be fetched from or stored into different 
addresses from those specified by the stream that CPU/ initially started executing. Testing th 
validity of the data fetched from translation tables when CPUs have the capability of altering 
certain translation table entries can basically be performed as described previously. The 
process of testing the validity of such data includes investigating whether the modified entries of 
the translation table are fetched according to the block concurrence rules. 

Execution Sequence and Multiple Shared Locations 

[0036] In an exemplary embodiment of the present invention, verification of the machine 

compliance to the execution order is based on the fact that all stores (and all fetches) must be 
executed in order. Once CPU/ observes data stored by CPU,-, an exemplary embodiment of the 



POU920020125USL 



21 



JL-U TO W-a O 33 I.O 



present invention deletes from the observable lists of CPU, any data stored by CPU/ prior to this 
store operation. For each CPU that fetches data from shared-locations, copies of all lists of 
data stored into the shared-locations and the sequence of store operations of each CPU are 
created. Once CPU,- fetches data from a shared-location and CPU; is identified to be the 
processor that stored the data, the lists of stored data by CPU y (which can be observed by 
CPU,) may be reduced. The fact that the sequence of all store operations of CPU; is known 
allows a reduction in more than one list (i.e., at different memory locations) of data stored by 
CPU,, 

[00371 For example, let M h M 2y and M 3 be three different shared-memory locations in 

which CPU;, CPU 2 , and CPUj stored the following data. 

M } 

CPU; CPU 7 CPU, 
a b c 
g e f 
d i j 





A& 






Mj 




CPU, 


CPU; 


CPU, 


CPUy 


CPU 7 


CPU, 


a 


X 


r 


c 


y 


X 


b 


e 


f 


d 


X 


y 


s 


n 


j 


n 


p 


s 
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In addition, the example includes a sequence of store operations of CPU> as follows: 

M 2 *= x 

M 3 ~x 
M 2 - e 
M 2 «= n 
A/,-e 
M, «- p 
M,-i 

[0038J If CPU, observes "p" from M h more than one list of data stored by CPU 2 may 

be reduced after verifying that "p" was stored by CPU? alone (i.e., is unique and valid data). An 
exemplary embodiment of the present invention reduces the lists of data stores in the following 
manner by taking into account that observing the "p" implies that all stores of CPU 2 prior to *'p" 
must have happened. After "p" is observed, CPU, will not observe "y" and V fromM,. It is 
also known that M t «= b and M, «- c (by CPU?) took place before M 3 <= p. Therefore, the 
value "b M (written by CPU 2 ) is overwritten by M / e. In addition, after "p" is observed, CPU, 
will never observe "x" and "e" from M 2 because the CPU 2 operation of M 2 «- n happened 
before the M, «- p operation. An exemplary embodiment of the present invention reduces the 
lists of data stored into M h M 2 and M s after ■ l p" is observed as follows. 
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Mi 

CPU , CPU, CPU, 
a e c 

g . i f 

d j 



M 2 

CPU , CPU, CPU, 

a n r 

b f 

s j 



Mj 

CPU ; CPU, CPU; 

c p X 

d y 

n s 



[0039] In addition, in an exemplary embodiment of the present invention, verification of 

the machine'compliance to the left to right order is demonstrated by using the move character 
(MVC) operation, as an example operation. The MVC operation moves a number of 
consecutive blocks from one memory location to another. For example, let each of M h M 2 , and 
A/j be one block. If CPU 2 executes a MVC that stores into these blocks, and M s is determined 
to contain data from that MVC operation, Mj and M 2 should not contain any data of previous 
stores from CPU 2 . M ( and M 2 should have the data stored by the MVC operation or any other 
store operation that may take place after the MVC. For example, M h M 2 and M 3 are three 
different shared-memory locations into which CPU/, CPU 2 and CPU? store the following data. 

Mi 

CPU , CPU, CPU, 
a b c 
e f 



CPU/ CPU, CPU, 
a x r 
b e f 



CPU . CPU, CPU, 
c y z 

P 



y 

s 
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[0040] In addition, the example includes a single instruction performed by CPU;, 

MVC: Mi M 2 M 3 *= "eex", that causes three different stores: 

Mi «= e 
M 2 «- e 

[0041 ] Finally, the example assumes that the sequence of store operations of CPU 7 is as 

follows. . 

Mi «- b 

M 2 -x 
M s ~y 
M 3 <=p 
A/,-e 
A/v*=n 

M t *- e 
A// *= i 

[0042] If CPU, fetches "x" from M 3 , then an exemplary embodiment of the present 

invention reduces the lists of CPU 2 observable data in the following manner. The list of CPU 2 
observable data from M 3 is reduced by deleting y* ™ d "P" The list of CPU; observable data 
from M 2 cannot contain "x" anymore and the list of CPU 2 observable data from M f cannot 
contain "b" anymore. The reduced lists are as shown in the following table. 
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Mi M2 Mj 

cpu , cpu, cpu, cpu , cpu? cpu ? CPU/ cm CPU; 

aec aer cx z 

if b n f y 

s j s . 

Testing Machine Compliance to Serialization Rules 

[0043J Although a processor may pre-fetch data for performance purposes, the 

processor is expected to discard any pre-fetched data when serialization, which is usually 
bigerred by the execution of certain instructions, is performed. To test if a processor CPU 6 
obeys the serialization rules, a combination of CPU fl fetches and the stores of CPU A , maybe 
used. To make a meaningful conclusion about CPU^'s compliance to the serialization rules, it 
must be known that the data fetched by CPU fl from M is not the last store of CPU 6 into M. The 
exemplary serialization verification algorithm shown in FIG. 3 attempts to eliminate the stores of 
CPU 0 that should not be observed by CPU 6 , and vice versa. The algorithm takes advantage of 
instructions known to cause serializations. For each. CPU, it is assumed that lists of all its stores 
into and fetches from the shared locations are available. 

[0044J If CPU fl fetches data stored by CPU* and it is determined that the data could 

have been overwritten by another store of CPU 6 , a correlation can be made between some of 
the processors 1 streams. Ser it CPUj denotes the serialization point that precedes the CPU fl fetch 
and Ser J C?X]b denotes the serialization point that follows the closest store of CPU* that 
potentially could overwrite the data. When CPU 0 fetches data stored by CPXJ b , a cross 
reference point (CRP), which is a pair of serialization points Ser i CrVa and Ser J C9Ub is created to 
indicate that, after &r f - , CPU a observed that CPU* did not execute/reach its serialization 
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instruction at Ser^ A subset of CPU, fetches known to be stored only by CPU^ is selected from 
the set containing CPU a fetches. A CRP is created for CPU fl if a store into a memory location 
by CPU* is fetched by CPU„ and it is determined that there is at least another store of CPU* that 
could potentially overwrite the data. For a given serialization point, at most one CRP can be 
defined. Tf a new CRP = (Ser i C?Vai &ry tCPUfr ) is to be created for CPU a , a check is made to 
determine if CPU fl already has another CRP = (Ser^ CPUfl , Ser* c ?vb)- If one already exists, the 
most restrictive CRP is kept. In other words if k is less than j, the new CRP replaces the 
previous one. 

[0045J Referring to an exemplary embodiment of an algorithm for testing machine 

compliance to serialization rules as depicted in FIG. 3, the terms used in FIG. 3 are defined 
below: 

Fcpm = f cpui(W = List of CPU* fetches from M\ 
St CPUi = List of stores of CPU;; 
S*cpuAM) = List of stores °f CPU i int0 M\ 

OWcpuiiM) - store operation of CPU i that can overwrite the observed data 
from M\ and 

S(Scm, cpui) = list of CPU/ stores that took place after Ser n and all accumulated stores up 
to &r„. 

[00461 At step 302 in FIG. 3, a loop is initiated for each pair of CPU's, referred to as 

each pair of CPU fl and CPU A . The instructions labeled 304 initiate the building of CRP's for 
CPU^ and the checking of CPU 6 fetches against CPU fl stores. This is accomplished by invoking 
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the instructions denoted 308 to create CRPs for CPU„ (because "y" has been set to "b"). For 
each CPU, fetch (because "x" has been set to "a") of data stored by CPU 4 into M, if other 
stores of CPU A that could overwrite M did not take place, a CRP is created or updated. The 
instructions labeled 310 are also invoked by step 304 to check the CPU 4 fetches against the 
CPU U stores and the CRPs. These instructions are utilized to determine if CPU 6 violated 
serialization rules. At step 306, the values for "b" and "y" are swapped and the reverse 
checking takes place: CRP's for CPU fl are built and CPU„ fetches are checked against the 
CPU fc stores. The algorithm depicted in FIG. 3 is performed for each pair of CPU's that utilize 
the shared memory location. 

[0047J An example of the processing performed by the algorithm depicted in FIG. 3 is 

described below and assumes store and fetch instructions as shown in the following table. The 
numbers represent the order of each individual instruction in the stream. The "$" denotes a 
serialization point. 
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CPU , CPLb 

2 M,~A, 4 M 2 -B, 

8 M,-A 2 7 M 2 -.B 2 

9 $ 12 $ 
16 M, —A } 20 

18 M,-A 4 22 M 2 ~5< 

20 $ 24 $ 

23 flj-M, 25 /? 2 <-M, (fetch A 2 ) 

24 30 M2-5 5 

28 R,~M 2 (fetch £,) 32 R 4 —M, (fetch /4j) 

35 M, y4 6 38 M 2 ~B 6 

40 $ 50 $ 

10049] After CPU, observes B, a CRP = (20,12) is created to signify that all CPU 2 

fetches after the serialization point at 12 can observe only the latest copies of all stores prior to 
the CPU, serialization at 20. As a result A,, A 2 and A 3 cannot be observed by any fetch 
instruction of CPU 2 that follows in sequence the serialization instruction at 12. Any CPU 2 fetch 
that retrieves these data would indicate violation in serialization rules. As an example, the CPU 2 
fetch at 32 is valid, as far as the serialization is concerned. On the other hand, the fetch at 25 
indicates that CPU 2 is using a pre-fetched data and failed to discard it after it serialized at 1 2. 
FIG. 4 is a block diagram of an exemplary system for testing the validity of shared data in a 
multiprocessing system. It includes several CPUs 404 that have access to the same shared 
memory location 402. Any type of CPU known in the art may be utilized in an exemplary 
embodiment of the present invention along with any type of memory capable of being shared by 
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two or more processors. In an exemplar embodiment the CPUs 404 have an IBMS/390 
architecture and the memory 402 is a direct access storage device (DASD). The example 
depicted in FIG. 4 has "n" CPUs accessing the shared memory location 404, where "n" may be 
any integer higher than one. In addition, each of the C PUs 404 include computer instructions to 
test the validity of shared data in a multiprocessing system. 

[0050] An embodiment of the present invention provides for testing data block 

concurrency, execution sequence and serialization in multiprocessing systems. Each CPU in the 
multiprocessing system may store random data into shared memory locations. The method 
allows one or more CPUs to store identical data into the same memory location. In addition, an 
embodiment of the present invention tests the validity of a data fetch by a given CPU under a 
pipe-lines and dynamic environment. The method is applicable to any size of block concurrent 
update. Instruction fetching of the IBM System 390 architecture, for example, is half-word 
block concurrent while operands of arithmetic and logical operations are word or double-word 
or more (e.g., quad-word) block concurrent. Additionally, an embodiment of the present 
invention takes into account stores and fetches caused by arithmetic and local operations 
engaging in register-to-memory and memory-to-memory data transfer operations. An 
embodiment of the present invention also handles the case where an instruction stream and/or 
translation table are subject to modifications due to the actions of some of the CPUs. 

[0051 J As described above, the embodiments of the invention maybe embodied in the 

form of computer-implemented processes and apparatuses for practicing those processes. 
Embodiments of the invention may also be embodied in the form of computer program code 
containing instructions embodied in tangible media, such as floppy diskettes, CD-ROMs, hard 
drives, or any other computer-readable storage medium, wherein, when the computer program 
code is loaded into and executed by a computer, the computer becomes an apparatus for 
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practicing the invention. An embodiment of the present invention can also be embodied in the 
form of computer program code, for example, whether stored in a storage medium, loaded into 
and/or executed by a computer, or transmitted over some transmission medium, such as over 
electrical wiring or cabling, through fiber optics, or via electromagnetic radiation, wherein, vyhen 
the computer program code is loaded into and executed by a computer, the computer becomes 
an apparatus for practicing the invention. When implemented on a general-purpose 
microprocessor, the computer program code segments configure the microprocessor to create 
specific logic circuits. 

[0052] While the invention has been described with reference to exemplary 

embodiments, it will be understood by those skilled in the art that various changes may be made 
and equivalents may be substituted for elements thereof without departing from the scope of the 
invention. In addition, many modifications may be made to adapt a particular situation or 
material to the teachings of the invention without departing from the essential scope thereof. 
Therefore, it is intended that the invention not be limited to the particular embodiment disclosed 
as the best mode contemplated for carrying out this invention, but that the invention will include 
all embodiments falling within the scope of the appended claims. Moreover, the use of the terms 
first, second, etc. do not denote any order or importance, but rather the terms first, second, etc. 
are used to distinguish one element from another. 
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